>tide is included in the target DNA region and is as indicated by t 
After PCR amrb.li I i il r 1 Ite i Hi 11 111 



subsequently be digested in the presence of the Fokl and Fspl restriction enzymes mtk 
; optimal conditions tor i ' dtant fragments after 

com* digestion, an S-iner and a 12-mer, n u depicted. In this figure, the 
polymorphism (A, in italic) is contained within the 12-mer fragment. 

Figure 2 I his fi„o e de[ lot i m - - iH a type US restriction enzyme, which 
- eJN « tdt t i 1 tin t 1 t 

q. ; „!.it^'ie.-d : 1 l s. dim' base ov. hana r ci d: <\ 5 ihuvei, td 

figure, by designing the primer so that the Fok I recognition site is located within 12 
bases or less of the 3' end of the primer one can assure that the Fok I cleavage will 
-i - Ii pmu |f e ^ 1 h,v,r.sthctui:il ; . ""'"r sit'stiths 

ion leaves blunt ends. The Fspl recognition site, TOCGCA, 



figure J. % this figure, the utility ofthsFsp KFo.' 
; entioni shown The Fspl recu lit nsiteove 
.I,.: i , be partially eennbined. 1 bus, iiieluibiie tin 
;n hi l 1 a! i ! ! ' ' I 1 e:i 



nl t i' « il i r n pin il i I i i i i 

:o the target DNA sequence. After amplification by PCR, the resuj: 
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, tin tlj ll Ii l l i 1 . 1' 1 I I 1 1-1 

he amplicon for subsequent digestion and mass specii nmen i t 

vn in this figure is incorporation of Bcgl, an 
ne that is capable of making two double strand cuts, one on the 5' side 
and one on the 3' side of their recognition site. The recognition site for Beg) is 
12/10(N)CGA<Jfl<;TGC(N)12/10, which after digestion results in fragm 
for mass spectsrometric analysis and identification of the polymorphic b; 



its sufficient 



vn in this figure is an example of 



in the present invention of 



a nick in the DNA amplic 
figure, a primer R is desi£ 
(GAGTCNNNN A NN) in 
i primer forms a hair-pin loop structure i 
however, the PCR amplicon has the .' 
with Fokl and NfBstNB I results in a 
base (Tin italic). St 




ration for one of the deoxyribonucleotides. A 
designed to contain a Fokl recognition site which upon hybridization with the target 
DNA sequence forms a hair-in loop. The primer also has a ribonucle.oside (rG) 
25 substitution which will additionally be incorporated into the amplicon.- The 

ribonucleoside substitution is base-labile and will cause a break in the backbone of the 
D> A at that site under basic conditions. Shown in this scheme, ne amplicon is 
incubated with the restriction are re 1 i inuble trand break. The 
amplicon is then incubated in the presence of base causing a break between the 
30 ribonucleotide G and the 3' deoxyrrbonucleotide T, releasing a 7 base fragment which 
can easily analyzed by mi 



Figure 11. The diagram illustrates the major approaches to haplotyping within the 
allele capture group of allele enrichment methods. As shown, methods can be broadly 
hose directed to single stranded DNA and (2) those directed to 
sIA. Ii is possible to capture DNA fragments in an allele specific 
manner by affinity to proteins or nucleic acids that discriminate single base differences 
I I t L p , I (affinity reagent re shown in the boxes. 



haplotypes. 

Figure 12. This diagram depicts 
specific amplification. After clea 
amplified, or separated by a size i 



15 Figure 13. This diagram depicts the 
haplotyping strategies based upon all 
is preferentially amplified fro 



haplotyping based on allele- 

| ii e t tl , 

he cleaved allele may be 



primers that exploi 



sequence differences at polymorphic 3it 



V: PCR 



Figure 14. Hair pin loop primers. In this figure the pri 
amplification are shown, fit allele 1, the polymorphic site is a T (italic) and 
incorporation of the ATCTGOA 5' portion of the primer occurs after at least one rount 
of amplification. In allele 2, the polymorphic site is also a T (italic)and incorporation 
of the ATCTGGA.5' portion of the primer occurs at least after one round of 
amplification, 



hlcic 1. -he pohno^hie site is a C (hade I and i-eoi |isaenori of the 
' portion of the primer occurs after at least one round of amplification. In 
., the polymorphic site is also a C (italic) and incorporation of the ATCCGGA 5' 
, of the primer occurs at least after one round of amplification. 

I , el I i ' 

itively, the minus strand of allele 2 is incapable of forming a hairpin loop due to 




FoUowing an amplification reacts „, 1 h, H ,m I 1 P I r 
incorporated into the amplicon - see sequence in box Figure 7), remaining micl< 
are removed using any of a variety of methods known in the art, such as spinning 
through a size exclusion column such as Sephadex O50 or by incubating with an 
alkaline phosphatase, e.g., shrimp alkaline phosphatase. The amplicon is then cl 
with the restriction enzyme (Fok I), which ge. 



polymorphic base. This 
10 polymorphic ^ am i: 



en be filled in with nucleotides in ' 



randingtooneofthepoi 
s modified nucleotide fT™ 1 in Figure 7). An example of such 
jxyuridine (BrdU) which is 64.8 Daltons higher in mass than 



ante in larger differences in mass between 
IS - fragments, making analysis, e.g., automated analysis, easier. 

fragment amenable for mass spectrometry analysis and identification of the 
1. Resulting DNA fragments can also be analyzed by 
retic detection methods. For example, DNA fragments 
mass modified nucleotides would show a different electrophoretic mobility 



i raphe - I - 



DN . 



site is allowed to hybridize to the target DNA forming a hair-pin loop, and subsequent 
amplification incorporates the Fokl site into the amplicon. The resultant amplicon is 
subjected to digestion with Fokl to separate the sequence 
and the residual nucleotides from the PCR reaction are r 
) The overhang sequence then is filled in with a polymerai 

nucleotides with one of the nucleotides of the polymorphic site being a 
dideoxynucleotide, or chain terminating nucleotide. Thus, differential fill-in of the 
overhang will be dependent on the presence or absence of the polymorphism and tht 



3' of the site of polymorphism 
moved as described above, 
e in the presence of natural 



primer is not labeled but the <ii(!eo>:y chain terminati 

the suspected polymorphic bases is labeled such that the fragment can be detected. 1 

preferred embodiment, each polymorphic base dideoxym Jeotide is labeled with a 
; uniquely detectable label and the identification of the polymorphic site is based upm 

presence of one signal and absence of another in the cases of homozygous or the 

presence of both signals in the cases of heterozygotes. 

In one embodiment, it may only be necessary to incorporate one restriction 

enzyme site into the amplicon via the primer. This can be done if the enzyme utilize 
i capable of making two double strand cuts, one on the 5' side and one on the 3' side i 

the recognition site. An example of such an enzyme is Beg I, n 

site of 12/10(N)CGA(N)STOC(N)12/10 (Figure 8). the arrows designate the sites c 



Another modification of the basic system is to use a third restriction enzyme that 
. cleaves only one of the two alleles, such that the presence of a polymorphic site yields 
shorter fragments than are observed in the absence of the polymorphic site. Such a 
modification is not universally applicable because not all polymorphisms alter 



rialh ad 



nucleotide. Only tht 



25 polymorphic site (in this example all th 
polymorphic site include foe cinacieoli 



GC),th 



sing the number of 



u nzyme Ital. ni,*iheD 
uising a double strand break) are used. One such enzyme is jN.BstNB I whose 
.■cognition site is GAOTCNNKN'NN. The iragnwr.ts generated by this scherrj 
utlined in Figure 9. This strategy would generate only one small fragment (10- 
lis case) instead of two, making analysis even more amenable to automation. A 




conditions that allow allele-specific hybridization. Optimally, the oligonucleotide is at 
least 10-100 nucleotides in length, and the hybridization is sufficient to withstand 
subsequent manipulations of the oligonucleotide/DNA complex. This complex then is 
subjected to conditions that will allow cross-linking of the oligonucleotide with the 
genomic DNA. The sample of DNA containing both the modified and unmodified 

protected allele-enriched DNA. 

In a preferred embodiment, binuclear Platinum (H) (PtH) complexes (figure 39) 



ue 'l:,ioiP:v 



licDNA. * 



osslinking an oligonucleotide coupled to a PHI to a 



complementary oligonucleotides. The thio oligonucleotide (10 picomole in 1 /iL) was 
incubated with 0.5 (tL of 0.1 mMKBrLt, 2 (lL of 1 mM phosphate/0.1 mM EDTA pH 
7.4, and 0.5 ph of 10 liU binuclear platinum (It) complex for 90 minutes at 37°C. The 
complementary oligonucleotide (0.01 picomoles in 0.5 (tL) was heated to 60°C for 3 
minutes and added to the above thio oligonucleotide mix. 0.5 uL of 0.5 M NaCIO, was 



at 37'C f« 



;au::t;ci:y by i lug that i h i jo J not i r bl 
a 5' OH replacing the 5' thio or with ah oligonucleotide wi 



Type I snake venom phosphodiesterase in 0.11 M Tris.HCl/Nacl pH 8.8, 15 mM MgClj 



h: sit.- of a riUcinssi-.uk at .vhirh pci--.. -he li 
The above experiments by Gruff et al. 
;ould be modified by i 



<fe\s:lo!> -i linplotypir-g pn-.-cfltn.- rtmc 'he IV! 



„i.n; .n jolymorpliisiii (figure 40, allele 1). Binuclear 
platinum (H) (PtH) is coupled to this oligonucleotide through the thio group using the 
procedure described by Gruff et al. or a similar method. The PHI coupled 
oligonucleotide could be used directly or the excess uncoupled Pffl may be removed by 

uncoupled Ptn may reduce nonspecific background adduct formation. It also may bo 
possible to find a method of oligonucleotide synthesis that will directly label the 
oligonucleotide dunn„ ' 1 1 ing the labeling and purification steps. 

A second oligonucleotide without the thio group is also designed. This 



the variance where it has the base corresponding to the other allele (figure 40, allele 2). 

is ri nil" 4 t ii ' * 

targeted site of variance and allowed to hybridize. The PtH coupled oligonucleotide 
hybridizes to the allele to which it is perfectly matched (allele 1) and the other 
oligonucleotide hybridize to the other allele to which it is perfectly matched (allele 2). 
The Pffl coupled oligonucleotide is then chemically crossUnked to the target DNA. 

20 il ls crosslinking protects this allele of the target DNA from degradation by 

exomicleases. Exonuoleases which are known to degrade single and double stranded 



be genotyped by any method which is ca >able o 
plate. Because there is only one allele left in the 
determination of the haplotype for this allele. 



-ne possible DNA mollifying agent. Trans- 
platiitum (II) diammine dichloiidc lias beer, shown to crosslink DNA when attached to 
an oligonucleotide (Chu BC, Orgel LE, DNA Cell Biology, Vol 9, pp. 71-76, (1990). 

f , Bioconjugate 

Chemistry, Vol 1, pp. S2-8S, (1990)). The method is not limited to the reagents listed 
above and should wo* with any exonuclease blocking agent which can bo specifically 
iraetecrooiiealH. 11. ni i ei i 1 > r - -*.hipi[H eicacrdfPNA) 
molecules can also be used F a t i wn to sequence pecifically hybridize to 

, , l h fbmdi i It 

DNA and blocking exonuclease activity. Two such agents are triple helices and 



cries in a Soahera Mot (hybridized wi 



ill, oUre-lanas, corresponding ro 
te, homozygotes ro: the allele 
the mi il 1 'hi >h " ' 



rits lacking the polymorphic i 



wilh the restriction site depends t I tan from II ] It r i t 1 ti 
I | I 1 1 ln 1 7 

In the past the locadon of polymorphic restriction sites and the sizes of the 
r „ Hindu I nr 1 roamed empirh. ally- Although many 

restriction -ite polymorph v-rs na - been eonverted toPCR assays by designing 



DNA. Genotyping of the allele 
enriched DNA sample, can proeeee by a rrerhod known to one skilled in the art 

t 1 i ltu|.-dt • 1 i | Il 1 a 

sequencing, allele-specific oligonucleotide hybridization and sequencing (ASO), ar 
i a method de cnbcdi xMet iyr.mg Polynucleotides", US. serial num 

4 + i I - 8 filed September 9, 1999- As one 



rene-ions anil procedure 



rype ei il i i hi r " re fa alt',- hi..i del 
ictton fragment length polymorphisms (Rl'LI'si. 
i but not necessarily SNPs, that affect res' 



tod, RFLPs can be used to produce long range 
kb, frequently over 10 tt> and in some instance 



re IS. is 



(il) Id t- a r f eirher in the rearnrenr, or. oreierabK 
harreirrr DNA, rent airerr, i 1 enavrrie rewire: r,ra: rirerur a. ear; eric 
, d; , (liK1 [Tin, i i i I > Hi. nr bounds of the segmer 

haplotyped are defined by the nearest occurrence of RE 1 on either side of th< 
polymorphic site.; 

(iii) Prepare genomic DNA from samples that are heterozygous for ft 

, , , i , r i 

) (iv)Restn the r na ithlli. n Mil '1 itrecog la 

polymorphic site; 

(V) separate the restricted DNA usin;; any DNA size fractionating mi 
, mil g r, r iiagnrent interest Exemplary 



ae larger restriction fragment and, 



chromatography, e.g., sephadex of 

(vi) Isolate a first DNA ftactii 
optionally, a second DNA fraction coi 
necessary, purify DNA from each fraction for PCR. It is not necessary that the 
fragments be highly enriched in die fractions, only that each of the one or more D 
fractions contain a significantly greater quantity of one allele than of the other. A 
minimum differential allele enrichment that would be useful for haplotyping is 2: ' 



isl 5:1 aa 



■Jy 10:! ra- 



10 (vii) Genotype the polymorphic sites of interest in either one of 

(the one enriched for the larger allele or the one enriched for the smaller allele), or, 
optionally, determine genotypes separately in both size fractions. Since each fraction 
contains principally one allele, the genotype of the fractions provides the haplotypes of 
the enriched alleles. If only one fraction is genotyped, providing one haplotype, then 

15 the other haplotype can be inferred by subtracting the determined haplotype from the 
genotype of the total genomic DNA of the samples of interest. In a haplotyping project 
it is desirable to determine the genotypes in total genomic DNA of all samples of 
interest in advance of the haplotyping project, in order to determine, first, which 
samples actually require haplotype analysis (because they contain two or more sites of 

20 heterozygosity in the segment of interest), second, which samples are heterozygotes at 
the restriction site polymorphism selected for separation of the alleles by size, and are 
therefore suitable for analysis by the above method; third, the genotype of the total 
sample constrains the possible haplotypes, and provides a check on the accuracy of the 
. haplotypes. Preferably the haplotype of both alleles are determined separately and 

25 compared to the genotype of the unfractionated sample. Samples that are not suitable 
for haplotype analysis with one restriction enzyme (because they are not heterozygous 



Restriction endonuclease sites that flank the target segment can be exploited to 
30 produce optimally sized molecules for allele selection. For exar 
DNA sample can be restricted so as to produce two allelic DNA 
length (and perhaps also differ from one another by the presence 
binding sile for an allele specific binding reagent). Because of I 



a I i c 1 it ' hi ii t n de the target Ufl/ 

segment to be haplotyped (thereby producing the maximal size DNA fragment in: 
differs in respect to the presence/absence of a single binding s te), comp'cte result 
is a preferred method for controlling the size of DNA segments prior to allele 



I a | prr scheme that does not require a sra, 
rated in Figures 19 and 20, the initial steps are as 
rgment to be haplotyped (the exact boundaries wit 



endonuclease (RBI) (Bam HI in this example). The outer bounds of the segment to be 

15 polymorphic site; 

(iii) identify a second restriction endonuclease (RB2) (Nhe I in Figure 19) that 
cleaves only once within the segment to be haplotyped; 

(iv) prepare genomic DNA from samples that are heterozygous for the 
polymorphism identified in step ii. It is desirable that the average length of the genomic 

20 DNA be greater than the length of the DNA fragment being haplotyped; 

(v) restrict the genomic DNA with RE1; 

I , l r i In- 

blocking step can be performed by, e.g., selecting an RBI that produces termini not 
susceptible to exonuclease digestion - for example 3' protruding termini are resistant to 



in U.S patent application serial number 09/394,774 filed September 9, 1999, entitled A 
METHOD FOR ANALYZING POLYNUCLEOTIDES); or by ligaung adapters with 
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